Channel: DeepMind
Category: Science & Technology
Description: Research Scientist Hado van Hasselt discusses multi-step and off policy algorithms, including various techniques for variance reduction. Slides: dpmd.ai/offpolicy Full video lecture series: dpmd.ai/DeepMindxUCL21